A NOTE ON THE LARGE DEVIATIONS FOR PIECEWISE 
EXPANDING MULTIDIMENSIONAL MAPS 



R. AIMINO AND S. VAIENTI 

Abstract. We provide the large deviation principle for higher dimensional piece- 
wise expanding maps and by using the functional approach of Hennion and Herve, 
slightly modified. 



1. Introduction 

There are different ways to establish large deviation principles (LDP) for dynamical 
systems. One of them is the so-called "Laplace Method", which relies on the spectral 
properties of the Perron-Frobenius, or transfer, operator. This strategy has been 
developed in a very general and abstract setting by Hennion and Herve in [T5] . 
They assume that the transfer operator acts on a Banach spaces of measurable 
functions, and it is quasi-compact on it, i.e. it has a spectral gap. The existence of an 
invariant probability measure follows immediately, and, by using perturbation theory 
for linear operators, they derive a few others statistical properties. This approach 
covers a lot of systems, for instance expanding maps of the interval [151 XII. 1], 
Gibbs measures for subshift of finite type [151 XII. 2], and expanding Young towers 
[29] . Nevertheless, this theory seems inappropriate for expanding (discontinuous) 
maps in higher dimension, like those treated by several authors [D El El El El El ED 
E21 ESI ESI EH EH]- Indeed, Hennion and Herve assume that the Banach space on 
which the transfer operator acts, consists of measurable functions defined everywhere, 
but for the higher dimensional systems quoted above, the functional spaces usually 
considered (bounded variation, quasi-Holder or Sobolev spaces) consist of classes 
of equivalence of functions modulo the reference measure, and hence they are only 
defined almost everywhere. Furthermore, in [15], the Dirac masses must belong to 
the topological dual of the Banach space, so this theory cannot be applied directly 
for those systems. 

Nevertheless it appears that one could slightly modify the proofs from [15J in order 
to deal with a Banach space consisting of classes of functions. In particular we will 
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do it for the functional space of quasi-Holder functions mostly investigated in [25] 
and which verify an additional algebraic assumption which also plays a role in the 
Hennion-Herve approach. As a consequence we will get the large deviations principle 
for such systems and, as far as we know, this result is not present in the literature. 
We also prove the central limit theorem, but for the latter one already disposes of 
the Gordin-Liverani theorems [T3J 120] • 

Actually a weaker result for the large deviations of systems like those considered 
above has been recently obtained in [2]. We will comment about the difference with 
the spectral technique presented in this note in the Remark 2 below. We anticipate 
here that the paper [2] furnishes an upper bound for the deviation functions and 
whenever the correlation functions involving L l observables decay to zero with a 
summable rate. In order to check these assumptions for our systems we should 
further require that the density of the invariant measure is essentially bounded from 
below, but this assumption is not necessary in the spectral approach discusses later 
on. 

2. Assumptions and statement of the results 

We now give the precise assumptions under which the LDP is valid. Let (X, A, m) 
a probability space, and T:I->Ia measurable transformation, non singular with 
respect to m. Under these conditions, the Perron- Frobenius operator P : L 1 (m) — > 
L l (m) is well defined by Pf = -4^, where rn,f(A) = J T -i A f dm is absolutely continu- 
ous with respect to m. We stress here the fact that the functions under consideration 
are complex valued, as required by the spectral theory we are going to use below. 
The transfer operator enjoys some classical properties that we resume below; see [I] 
or [IS] for more details. 

(1) Linearity : P is a linear operator on L 1 (m), satisfying ||P/||i < ||/||i for all 
./> /-'('"): 

(2) Positivity : For all / G L 1 (m) such that / > m-ae, we have Pf > m-ae; 

(3) Preservation of integrals : For all / G L 1 (m), we have J Pf dm = J f dm; 

(4) Duality : For all / G L 1 (m) and g G L°°(m), we have J f(g o T) dm = 
f{Pf)gdm- 

(5) Invariant Measures : / G L 1 (m) is the density of a T-invariant probability if 
and only if / > 0, J f dm = 1 and Pf = f. 

Let us suppose now that we have a subspace B C L 1 (m), equipped with a norm 
|| . ||e such that 

(1) (£>, || . ||s) is a complex Banach space with continuous injection B — > L 1 (m); 

(2) Constant functions lie in B; 
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(3) B is a Banach algebra : there exists C > such that for all /, g G B we have 
fgeB with \\fg\\ B <C\\f\\ B \\g\\ B] 

(4) B is a complex Banach lattice : for every / G B, we have /, |/| G £>; 

(5) B is stable under P : P(B) C S; 

(6) P is a bounded operator on B, with spectral radius equal to one; 

(7) P is quasi-compact of diagonal type on B. 

The last assertion means that there exists a decomposition 

s 

p = J2^ + Q 

1=1 

where A, are complex numbers of modulus 1, IT are finite-rank projections satisfying 
rijllj = when % ^ j and Q is a bounded operator on B with spectral radius strictly 
less than 1 and satisfying QUi = HiQ = for all i. The spectrum of P consists then 
of a finite number of eigenvalues of modulus 1, with finite multiplicity, and the rest 
of the spectrum lies in a disc centered at with radius strictly less than 1. When 
B is compactly injected in L 1 (m), this can be deduced from a Lasota-Yorke type 
inequality, by means of the Ionescu-Tulcea and Marinescu theorem [161 EH] • See [15] 
for precise definitions and results about quasi-compactness. 

Under those conditions, the existence of an T-invariant probability fi absolutely 
continuous w.r.t m, such that G B is a classical result : for every / G B such that 
/ > 0, f f dm = 1, and so in particular for / = 1, quasi-compactness implies that 
the sequence - J2k=o P k f converges in B to a function /* such that the measure [if 
with |J = /* 

is an acip. Furthermore, 1 is an eigenvalue of P. If we assume that 
1 is a simple eigenvalue of P, then there exists an unique acip [i such that G B. 
From now, we will always assume that 1 is a simple eigenvalue, and that there is no 
other eigenvalue of modulus 1. \i will denote the unique acip, and v G B its density. 
We then have 0, for all n > 1 and / G B 

P n f =<m,f>v + Q n f 

As a consequence, we get exponential decay of correlation : there exists C > and 
< A < 1 such that for every / G B and every g G L°°(/x) we have 



J f(9°T n )djj- J fdfx J gd/j, 



<C7A b ||/|| b |M|l- 



When ip £ B* belongs to the topological dual of B, we denote < <p,f >— ¥?(/)• The linear form 
/ — > J f dm belongs to B*, and we denote it by to. 
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Let now : X — > R a bounded observable which lie in B, with zero mean J §d\i = 
0. Denote by S n the Birkhoff sums : 

n-1 
k=0 

We are now able to state the LDP : 
Theorem 1. (Large Deviation Principle) 

Under the above conditions, the limit a 2 = lim^oo J(-^) 2 d\i exists, and if a 2 > 0, 

then there exists for some eo > a rate function c :] — eo, +eo[— > R, continuous, 
strictly convex, vanishing only at 0, such that for every < e < eo and every proba- 
bility measure v with v <^m and ^- G B, we have 

lim — log v(S n > ne) = — c(e) 

n— s>oo n 

As an easy consequence of the techniques introduced in the next section, we also 
get the central limit theorem. We denote with jV(0, a 2 ) the Dirac mass Sq if o 2 = 0, 

and the probability with density ]— e 57* with respect to Lebesgue if o 2 > 0. 



Theorem 2. (Central Limit Theorem) 

Under the same assumptions of Theorem 1, ^fe converges in distribution to Af(0, o 2 

in the probability space (X, A, v) for every probability v with v m and 4^ G B 
for every bounded continuous function g : R — > R, we have 



lim f g(^)du= [ gdj\f(0, a 2 ) 
n ^°° J V n J 



Remark 3. 

(1) Theorems 1 and 2 apply in particular for v = m and v = //, so the LDP and 
the CLT are valid for both reference and invariant measures. 

(2) As we anticipated in the Introduction, the paper [2] gives an upper bound 
for the large deviation function and under related assumptions. In particular 
Th. E in [2] states the following, with our notations. Let us suppose that 
T preserves an ergodic probability measure \i\ then let B C L l (fi), (ft G B, 
and assume that there exists £ (n) with Y^=o £ ( n ) < 00 such that for all ip G 
L 1 ( / u) we have \J(f)('ijjoT n )dfi — J(f)dfiJ'i/jdfi\ < £(^)||/||b||5 , ||l2°- Then 
there exists r = r(<f>) > and, for every e > 0, there exists C = C((f>, e) > 
such that fi(S n > ne) < Ce~ Tn . 

The proof of this result relies on a martingale approximation. Section C 
in [2] provides examples of systems for which one gets exponential decay of 
correlations (and hence £(n) is summable) and against if) G L l (fi): this last 
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assumption requires the density to be bounded from below. In conclusion 
the result of our note extends the previous one in two directions : first, we 
obtain a LDP, and not only an upper bound; secondly, we do not have to 
assume anything about the density, but the fact that it is in the L l norm 
with respect to the conformal (reference) measure. 



3. Proofs 

We begin by observing that the existence of the variance a 2 results from a straight- 
forward computation : since the sequence f 0(0 o T n ) d\i decays exponentially fast, 
it is absolutely summable, and then we see, by expanding the term S 2 , that the limit 
a 2 = lim^oo f (-^) 2 dfi exists and we have 

+oo 



2 dfi + 2 / <K0 ° Tn ) dfi 

71=1 ^ 



We assume from now o 2 > 0. Our proof of the LDP follows closely [15] except for 
a minor modification, which will be mentioned later. The same approach had been 
employed in [23]. Let / G B the density of the measure v with respect to m. We 
will apply Gartner-Ellis theorem [TU1 [T2] , so we are interested in the convergence of 
the sequence - log J e eSn f dm for 9 G R small enough. We introduce the "Laplace 
transform" operators P z , for z6C, defined by 

P z (f) = P(e*+f), feB 

Assuming for a moment that P z is well defined, we see immediately that we have 
J e n f dm = J Pg (f) dm. In order to prove that P z is a bounded operator on B, we 

just have to check that e z< ^ G B. Since B is a Banach algebra, the sequence Y^=o ^kl~ 
converges in B, and hence in L l (m). On the other hand, this sequence converges 
uniformly, and hence in L 1 (m), to e z< ^, and so we get that 

+ 00 / , N „ 



71=0 



in B. It also proves that the map z — > P z is holomorphic and we have the expansion 

n=0 

where C n (f) = P(0"/). 

We can now apply perturbation theory for linear operator to prove the following 
result. The proof relies on analytic functions of operators, see [UJ, or on the implicit 
function theorem, see [15] . For 9 > 0, we denote 3 e = {z G C / \z\ < 9}. 
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Proposition 4. There exist 6q > 0, C > 0, 771,772 > and holomorphic functions 
A(.) : B 0O ->■ C, v(.) : D flo -> B, <p{.) : D flo ->■ B* and Q(.) : D flo -> suc/i toat /or 

all z G Bg 

(I) A(0) = 1, v(0) = v, <p(0) = m, Q(0) = Q; 

(II) P z (f) = AO) < <p(z), f > v(z) + Q(z)f for all f G B; 
(Hi) < ip(z),v(z) >= 1; 

(iv) Q(z)v(z) = and ip(z)Q(z) = 0; 

(v) \X(z)\>l-r ]l ; 

(vi) \\Q{z) n \\<C{l-r ]l -r] 2 ) n . 

So, for all n > 1, we have 

PIV) = \{z) n < V (z), f > v(z) + Q(z) n f 

We can say much more on eigenvalues and eigenvectors when z = 9 is real. At this 
point, we need to show that for every positive function / e B with / ^ 0, there 
exists a positive linear form ip e £>* such that < </?, / > > 0. In the context of [15J, 
since functions are defined everywhere, there exists x G X such that f(x) > 0, and 
so the Dirac mass 5 X does the job. In our context, Dirac masses are not available, 
but the reference measure is usable , since necessarily < m, f » 0, otherwise, 
/ would be m-ae, and so / = in B. This was not the case in [15] because 
they consider functions defined everywhere, and not classes of equivalence. We can 
also use arguments from complex Banach lattice theory |22j : a modification of the 
Hahn-Banach theorem shows that there exists a positive bounded linear form (p on 
Br = {/ G B I f{x) G M m— ae}, such that < (p, f >= 1, and then we can extend 
it on all B. This argument could be employed in more abstract contexts, where the 
Banach space B consists of distributions-like objects and when we don't have a good 
knowledge of its topological dual. 

Proposition 5. There exists < Q\ < 6 such that for every 9 G M with \9\ < 1; 
we have \{6) > 0. Furthermore, v(.) and <p(.) can be redefined such that v{6) > 0, 
(p(0) > 0. 

Proof. As Pq is a real operator, we have Pgf = Pgf for all / G B. So, we have 
Pgv(6) = Pgv(6) = X(6) v{9). Since X(6) is the unique eigenvalue of Pg with maximal 
modulus, we get X(8) = X(9), and hence X(8) G R. Since A(0) = 1, by a continuity 
argument, we obtain X(9) > for small 9. For z G C small enough, < <p(z), t >^ 0. 
We define v(z) =< ip(z),t > v(z) and <p(z) =< <p(z),t > _1 <fi(z). Those new 
eigenfunctions satisfy obviously the conclusions of the previous proposition. We have 
just to prove that v{9) and (p{9) are positive for 9 G M small enough. By the spectral 
decomposition of Pg, we see that X(9)~ n Pgt goes to v{9) in B, and hence in L 1 (m). 
We then get v{9) > because Pg is a positive operator and X{9) is positive too. Now, 
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let $(0) G B* positive such that < i/)(6),v(6) >= 1. Then, \{9)- n {P*) n ^{9) goes to 
< ip(9),v(9) > <f(9) = <f (9), which proves that if {9) is a positive linear form. □ 

We denote 

A(0) = logA(0) 

We then have 

Proposition 6. There exists < 92 < 9\ such that for every 9 e B with \9\ < 92 
and every f 6 B with f > and J f dm = 1, we have 

lim - log / e 8Sn fdm = A(9) 

n-s>oo n J 

Proof. We have the identity 

es -fdm =< m, P r e \f) > = \{9) n < if {9), f >< m, v{9) > + < m, Q(9) n f > 

= X(9) n (< f (9), f><m, v{9) > +X(9)- n < m, Q{9) n f >) 
All involved quantities are positive, hence we can write 

-log I e eSn fdm = logA(#) + -log(< ip(9), f >< m, v{9) > +A(0)~ n < m,Q(9) n f >) 
n J n 

Since 

lim < ip {9), f ><m, v{9) >= 1 

and since the spectral radius of Q{9) is strictly less than X(9), it's easy to see that 
for 9 small enough, we have 

lim - log(< <f (9), fxm, v{9) > +\(9)- n < m, Q{9) n f >) = 

□ 

In order to apply Gartner-Ellis theorem, we just have to show that A is differen- 
tiable function, strictly convex in a neighborhood of 0. Since A is real-analytic, A 
is too. Computations from perturbation theory @ show that A'(0) = J <f>d\x = and 

A"(0) = cr 2 , so we have A"(0) = A ^^tq^ ~ = cr 2 > and we can now apply the 
following local version of Gartner-Ellis theorem, whose proof can be found in lemma 
XIII.2 in H5] : 



2 See corollaries III. 11 and III. 6 in [15] . 
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Proposition 7. For all n > 1, denote by P n a probability measure on some mea- 
surable space (Q,T), by E n £/ie corresponding expectation operator and by S n a real 
valued random variable. Assume that on some interval [—0\,8a], 9\ > 0, we have 

lim ilo g E n [exp(0£ n )] = A(0), 

n—too fl 

where A is a strictly convex continuously differentiable function satisfying A'(0) = 0. 

(0a) ^ n r -:- A(dj 
e A * u > fc - e A 



Define e+ = ^ > 0, e_ = ^ < and c(e) = sup {6e - A(0)}. Then c is a 

\9\<9 A 

positive function, strictly convex on [e_,e + ], continuous, vanishing only at 0, and, 
for all < e < eo = e + , we have 

lim -logP n (5„ > ne) = -c(e) 

n— >-oo J7, 

We now prove our Theorem 2. 

Central Limit Theorem. By Levy's continuity theorem, it suffices to show that for 
all t e R 

lim / e v^/ dm = e 2 

We have 

/ e^/dm =< m,P^ (/) >= A(-^) n < ¥>(40,/ > < m,v(4=) > + < ™, Q(4o 
We just have to prove that 

if *2„ 2 

lim A(^=) n = e'-^- 
'n 



But the Taylor's expansion says that in a complex neighborhood of 

\{z) = A(0) + A'(0)z + + z\(z) = 1 + ^ + z 2 7/(z) 

where lirn^o T/ 1 ^) = 0. Then, a standard computation concludes the proof. □ 

4. Application to uniformly expanding maps 

The main application of our Theorem 1 will be to multidimensional piecewise 
uniformly expanding maps, in particular when we equip them with the space of the 
quasi- Holder functions. This space, introduced by Keller [17], developed by Blank [3] 
and successfully applied by Saussol [25] and successively by Buzzi [7J (see also [8]) 
and Tsujii [28], reveals to be very useful to control the oscillations of a function under 
the iteration of the transfer operator across the discontinuities of the map. Moreover 
it verifies the algebraic assumption 3 in Section 2 above; it is not straightforward 
to replace this condition and in order to fit with the Hennion-Herve theory, if one 
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uses the more conventional spaces of bounded variation functions (see for instance 
PQ El HI [2TJ 123) or the Sobolev spaces [27], and this topic deserves to be investigated 
in the future □. 

Let us now recall the precise definitions of our system by following closely the 
assumptions imposed in [25]. Let M C M. d be a compact subset with intM = M 
and piecewise C 1 boundary. We denote by d the Euclidean distance and by m the 
Lebesgue measure on M. d . We can assume without loss of generality that m(M) = 1. 
For A C M and e > 0, we denote B e (A) = {x G R d / d(x, A) < e}. Let T : M -> M 
a measurable application, and suppose there exists < a < 1 such that for some 
small enough eo we have : 

(1) There are finitely many disjoint open sets [/, C M with m(M \ Uj£/j) = 
such that for each i, Ti := T\u i — > M is C l+a and can be extended on a 
neighborhood V- of U { to a map r< : -»- M ' such that B eo (TiUi) C 
Tj(Vi). Moreover, each Tj : — )■ lR d is injective with C 1+a inverse; 

(2) There exists c > such that for any i, and any x, y G T(Ui) with <i(a;, < e 
we have 

IdetDTT 1 ^) -detDi;- 1 ^)! < c| det D^ 1 ^)!^, 

(3) There exists s(T) < 1 such that 

sup sup H-DXp 1 ^)!! < s(T); 

(4) Boundaries of C/j are piecewise C 1 codimension one embedded compact sub- 
manifolds and we have f]o{T) < 1 where 

m(T) = s(Tr + -^^Y(T)^ 
1 - s(T) 7 d 

Y(T) = sup (j{smooth pieces intersecting dUi and containing x} 



and 7^ = is the ci-volume of the <i-dimensional unit ball of M. d . 

The last condition can be greatly weakened, but the condition in [25] is of a very 
abstract nature, and it's more easy to handle with this one when the boundaries of 
the U~i are smooth. We define then the functional space on which acts the transfer 



3 In fact, if we check the proof, we only need the fact that B is Banach algebra and (f> G B to 
prove that the operators P z are well defined and holomorphic in z. So we can suppose the weaker 
assumption that <p is such that P z define a holomorphic family of bounded operators on B for z in 
a complex neighbourhood of 0. 
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operator. Let / G L x (]R d ). If A C M. d is a Borel subset, we define the oscillation of / 
over A by 

osc(/, A) = ess sup \f{x{) - f(x 2 )\ 

Xl,X'2£A 

where the essential supremum is taken with respect to the product measure m x m 
on A x A. We get a lower semi- continuous and hence measurable function x — > 
osc(/, B e (x)). We set 

\f\a = sup / osc(f,B e (x))dx 

0<e<e o e </R d 

We define 

V a (R d ) = {feL l (R d )/\f\ a <oo} 

and 

K(M) = {fe V a {R d ) I supp /cM} 

both endowed with the norm ||/|| Q = 1 1/11/4 + |/|«- Adapting proofs from [17], we 
can show that V a (M) is Banach space, with compact injection in L 1 (M). It's proven 
in [25] that V a (M) is also a Banach algebra, and it's obviously a Banach lattice. So, 
if we want to apply our previous results to those maps, we are left to prove that the 
transfer operator for T acts on V a (M) and is quasi-compact of diagonal type. But 
Saussol proved in this context a Lasota-Yorke inequality (lemma 4.1 in [23]) which 
implies the quasi-compactness of the Perron- Frobenius operator. Hence, assuming 
that the system is mixing, we get the central limit theorem and the large deviations 
principle for bounded real observables in V a (M). 
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